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in  Decision  Diagrams  of  Symmetric 
Muitipie-Vaiued  Functions 

Jon  T.  Butler,  Fellow,  IEEE,  David  S.  Herscovici, 
Tsutomu  Sasao,  Fellow,  IEEE,  and 
Robert  J.  Barton  III,  Member,  IEEE 

Abstract — We  derive  the  average  and  worst  case  number  of  nodes  in 
decision  diagrams  of  r-valued  symmetric  functions  of  n  variables.  We 
show  that,  for  large  n,  both  numbers  approach  .  For  binary  decision 

diagrams  (r=  2  ),  we  compute  the  distribution  of  the  number  of 
functions  on  n  variables  with  a  specified  number  of  nodes.  Subclasses 

of  symmetric  functions  appear  as  features  in  this  distribution.  For 

2 

example,  voting  functions  are  noted  as  having  an  average  of 

2 

nodes,  for  large  n,  compared  to  for  general  binary  symmetric 
functions. 

Index  Terms — Decision  diagrams,  BDD,  symmetric  functions,  multiple¬ 
valued  functions,  complexity,  asymptotic  approximation,  average  case. 

-  -f  - 

1  Introduction 

During  the  past  ten  years,  significant  effort  has  been  devoted  to 
the  study  of  the  (ordered)  binary  decision  diagram  (BDD).  With 
origins  that  predate  VLSI  (Akers  [1]),  BDDs  were  set  on  a  firm 
mathematical  basis  when  Bryant  [2]  proved  that,  for  each  switch¬ 
ing  function,  there  exists  a  canonical  BDD.  A  natural  extension  is 
the  multiple-valued  decision  diagram  (MDD),  used  to  represent  a 
multiple-valued  function,/:  R  —>R,  where  R=  jO,  1,  2,  ...,  r  -  1). 

To  construct  the  MDD  for  a  given  function /(Zj,  X2,  ...,  x„),  we 
use  a  root  vertex  to  represent  the  function  itself,  and  attach  r 
children  to  represent /(O,  X2,  ...,  x„),f{l,  X2,  ...,  x„),  and  so  on  up 
to/(j'  -  1,  X2,  ...,  T„).  To  each  of  these  children,  attach  r  children  to 
represent  the  assignments  to  X2,  and  continue  until  all  variables  are 
assigned.  The  leaf  nodes  represent  the  functions  0,  1,  and  so  on  up 
to  r  -  1.  Whenever  some  function  appears  more  than  once  in  the 
graph,  we  merge  all  instances  of  that  function  into  a  single  node. 
We  also  delete  a  node  that  has  identical  children. 

An  important  measure  of  MDD  complexity  is  the  number  of 
nodes.  For  example,  the  first  two  MDDs  in  Fig.  1  are  BDDs  of 
functions  on  nine  variables.  For  a  complete  description  of  this  fig¬ 
ure,  the  reader  is  referred  to  Section  2.  Fig.  la  represents  the  AND 
function,  while  Fig.  lb  represents  the  deBruijn  function,  so  called 
because  it  corresponds  to  a  Hamiltonian  cycle  in  the  deBruijn 

graph  B3  (Fredricksen  [5]  and  Wegener  [10]).  A  deBruijn  function 
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is  a  function  on  n  =  2  +  k  —  2  variables,  represented  by  a  string  CT 

that  contains  one  (and  only  one)  copy  of  every  possible  substring 

of  length  k.  Nodes  in  this  BDD  correspond  to  every  distinct  sub- 

2 

string  of  <T.  The  number  of  such  nodes  is  asymptotic  to  foi"  large 
n,  as  is  suggested  by  Fig.  lb  in  which  the  arrangement  of  nodes  is 
approximately  one  half  a  square  of  size  n  by  n.  This  represents  the 
worst  case. 

In  this  paper,  we  enumerate  nodes  in  BDDs  of  totally  symmet¬ 
ric  functions.  This  is  an  important  subset  of  all  functions,  in  which 
the  function  is  unchanged  by  any  permutation  of  variables.  From 
now  on,  we  will  use  the  term  "symmetric"  to  denote  totally  sym¬ 
metric  functions.  There  are  at  least  four  papers  that  discuss  the 
worst  case  number  of  nodes  in  BDDs  of  symmetric  funchons.  Bryant 
[2]  noted,  in  passing,  that  the  worst  case  complexity  of  BDDs  for 
symmetric  functions  is  0(n  ).  Two  complete  papers  have  been 
devoted  to  the  calculation  of  the  worst  case  number  of  nodes,  the 
first  of  which  is  Ross  et  al.  [8].  In  the  second.  Heap  [6]  presented  a 
correction  to  the  results  in  [8].  Independently,  Sasao  [9]  derived  a 
similar  expression.  The  last  paper  is  an  example  of  the  growing 
number  of  papers  on  MDDs,  e.g..  Miller  [7].  Within  binary,  we 
know  of  only  one  other  paper  devoted  to  the  average  number  of 
nodes  in  a  class  of  switching  functions.  Butler  and  Sasao  [3]  con¬ 
sider  a  special  class  of  threshold  functions  called  Fibonacci  functions. 

An  interesting  and  important  question  is  whether  the  typical 
BDD  of  a  symmetric  function  has  a  complexity  more  representa¬ 
tive  of  the  BDD  in  Fig.  la  or  of  the  BDD  in  Fig.  lb.  We  answer  this 
question,  showing  that  the  typical  BDD  is  more  like  that  of  Fig.  lb. 
Our  results,  however,  go  beyond  this.  We  derive  the  average 
number  of  nodes  in  MDDs  of  r-valued  functions  of  n  variables, 
where  n  is  large.  Since  the  worst  case  number  nodes  for  general 
MDDs  has  not  been  previously  demonstrated,  we  do  this  too, 
showing  that,  in  the  limit  as  n  approaches  infinity,  both  numbers 
are  asymptotically  the  same.  Another  contribution  of  this  paper  is 
the  experimental  results  shown  in  Section  4.  Here,  we  show  the 
actual  distribution  of  BDDs  with  respect  to  the  number  of  nodes 
and  the  number  of  functions.  Important  subclasses  of  the  symmet¬ 
ric  functions,  e.g.,  AND,  OR,  and  other  voting  functions,  are  seen 
as  ridges  in  this  distribution.  Finally,  we  compare  the  general 
symmetric  function  to  voting  functions,  showing  that  the  worst 
case  number  of  nodes  in  BDDs  of  voting  functions  is  one-half  that 
of  general  binary  functions,  while  the  average  number  of  nodes  in 
BDDs  of  voting  functions  is  one-third  that  of  general  binary  sym¬ 
metric  functions. 

2  MDDs  OF  Symmetric  Functions 

A  symmetric  function  f{X)  on  n  binary  variables  can  be  repre¬ 
sented  by  a  binary  string  of  length  n  +  1,  ...  b;  ...  b„,  where  fo, 

is  the  value  of/(X)  when  i  of  the  n  variables  are  1.  For  example, 
binary  string  representations  001,  010,  101,  and  011  correspond 
to  the  AND,  Exclusive  OR,  Exclusive  NOR,  and  OR  function, 
respectively,  on  two  variables.  With  /(X)  represented  as  a  binary 
(n  +  l)-tuple,  Fy  =  bgbi  ...  b;  ...  b„,  any  substring,  fc/s+j  ...  bf  of 
represents  a  function  that  is  a  node  in  the  BDD  of  By.  This  can  be 
seen  in  Fig.  la.  The  root  node  is  labeled  by  0000000001,  which  rep¬ 
resents  the  AND  function  on  nine  variables;  i.e.,  the  function  that 
is  1  iff  all  nine  variables  are  1.  One  of  the  two  successor  nodes  is 
labeled  000000001,  the  AND  of  eight  variables.  Indeed,  there  is  a 
succession  of  nodes  representing  the  AND  function  from  the  root 
down  to  the  terminal  nodes. 

We  can  use  triangles  to  represent  three-valued  symmetric 
functions.  For  example. 
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MODSUM  Function 

(c) 

Fig.  1 .  Examples  of  decision  diagrams  of  symmetric  functions. 


3  0 

9  19 

Number  of  2's  ^  2  o(T) 

o|o  170 

0  12  3 
Number  of  1's 

shows  the  use  of  a  triangle  to  represent  the  MODSUM  function  on 
three  three-valued  variables.  The  circled  1  means  that  the  function 
evaluates  to  1  when  there  are  zero  Os,  two  Is,  and  one  2  among  the 
variables.  Fig.  Ic  shows  the  MDD  of  the  MODSUM  function  of 
three  three-valued  variables.  In  the  case  of  this  example,  the  root 
node  (the  MODSUM  function  itself)  is  represented  by  the  4x4 
triangle  shown  above,  while  all  nodes  at  the  next  level  down  cor¬ 
respond  to  distinct  3x3  subtriangles,  nodes  in  the  next  level  down 
by  distinct  2x2  subtriangles,  and  nodes  in  the  last  level  by  distinct 
1x1  subtriangles;  i.e.,  by  the  three  logic  values.  Extending  this  to 
four  values  results  in  a  symmetric  fimction  representahon  that  is  a 
tetrahedral. 

A  general  representahon  can  be  obtained  as  follows.  Let  P  be  the 
set  of  ordered  parhtions  of  n  into  r  nonnegative  parts.  That  is,  P  con¬ 
sists  of  r-tuples  {tig,  fj],  ...,  where  n,  >  0  represents  the  number 

of  variables  that  take  on  logic  value  i,  and  rig  +  +  ...  +  n^_i  =  n. 

The  number  of  such  partitions  is  the  number  of  ways  of  choosing  n 

^  ^  Y  _ 

objects  from  r  with  repetition  or  ^  .A  symmetric  function/ 

on  n  r-valued  variables  can  be  represented  as  a  mapping  F^:  P  —>  R, 
where  R  =  jO,  1, ...,  r  -  1)  is  the  set  of  logic  values  of/.  The  number 

I'n+r-lj 

of  such  mappings  is  . 

If  ?7  is  a  node  in  the  MDD  of  a  given  symmetric  multiple-valued 

function  /,  then  the  function  associated  with  7J  is  a  symmetric 
function  obtained  from  /  by  assigning  values  to  certain  variables. 
Because  /  is  symmetric,  it  makes  no  difference  which  variables  are 

assigned,  and/^  depends  only  on  the  number  of  variables  to  which 
each  logic  value  is  assigned.  Let  Vj  be  the  number  of  variables  as¬ 
signed  the  logic  value  i,  for  0  <  /  <  r  -  1.  Then,  fy  -i’q), 

-Dj),  •■•,(«, _i  whenever  {{rig-Vg), 

{rig  -  D]),  (n^-i  —  I’r-i))  is  an  ordered  partition,  i.e.,  whenever 

n,  -  Vj  >  0,  for  all  0  <  1  <  r  -  1. 

3  Average  and  Worst  Case  Number  of  Nodes 

We  have 

Theorem  1.  Both  the  average  and  worst  case  number  of  nodes  in  the 
MDDs  of  r-valued  symmetric  functions  are  asymptotic  to  frr  for 
large  n,  where  n  is  the  number  of  variables. 

We  calculate  the  average  number  of  nodes  in  MDDs  of  n-variable, 
r-valued  symmetric  functions  by  calculating  the  expected  number 
of  nodes  at  any  level.  Specifically,  we  show  that,  as  n  approaches 
infinity,  this  number  is  close  to  the  largest  number  possible  num¬ 
ber,  for  all  levels  except  those  near  the  bottom. 

Let  k  index  the  levels  of  the  MDD,  where  fc  =  0  corresponds 
to  the  root  node  and  the  given  function,  k  represents  the  num¬ 
ber  of  variables  that  have  been  assigned  values  so  far  in  the 
MDD.  Therefore,  at  level  fc  =  1,  there  are  at  most  r  nodes  corre¬ 
sponding  to  the  assignment  of  a  value  to  the  first  variable,  and 
at  level  k  =  n,  there  are  at  most  r  nodes,  corresponding  to  the  r 
possible  logic  values  in  the  function.  In  general,  there  are  at  most 
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N  =  nodes  at  level  k,  since  there  can  be  no  more 

nodes  than  there  are  ways  to  choose  k  logic  values  from  r  with 
repetition.  Each  of  these  N  nodes  can  represent  one  of 
fn-k.r-U 

M  =  r  ^  differenf  symmefric  functions.  That  is,  each 

symmetric  function  is  specified  by  a  mapping  from  a  parfition  to 
the  set  |0,  1,  ...,  r  -  1|.  The  number  of  elements  in  the  domain  of 
fhis  mapping  is  r-1  1®''®!  k,  n  -  k  variables 

remain  fo  be  assigned  values,  and  this  corresponds  to  choosing  n 

-  k  values  from  r  with  repetition.  We  can  now  state 

Lemma  2.  The  worst  case  number  of  nodes  in  the  MDDs  of  symmetric 

r 

functions  on  n  r-alued  variables  is  asymptotic  to  frr,for  large  n. 

fk  +  r  -1\ 

Proof.  Because  there  are  at  most  N  =  \  y  _i  nodes  at  level  k, 
the  total  number  of  nodes  in  the  MDD  is  at  most 


This  sum  has  a  simple  form.  We  can  choose  r  of  «  +  r  num¬ 
bered  balls  by  identifying  fhe  largesf  number.  Thaf  is,  if  we 
choose  the  ball  with  the  largest  number  to  be  fc  +  r,  there  are 

ways  to  choose  the  remaining  balls.  Summing 


k  +  r-1 
r-1 


over  0  <  k  <  n  yields  the  sum  shown  above.  However, 
^  is  also  the  number  of  ways  to  choose  r  oi  n  +  r  balls. 
Thus,  the  above  expression  is  equivalent  to 


(”r)- 


But,  for  large  n. 


is  asymptotic  to  . 


The  likelihood  that  any  given  pair  of  nodes  at  level  k  represents 
the  same  function  is  .  Since  there  are  j  pairs  of  nodes,  the 
expected  number  of  matches  is 


and  the  expected  number  of  distinct  nodes  (functions)  is  at  least 
If  we  choose 


fhen  we  have 


^  Kax  =  «  +  T-l-(2r!log,n)'-‘, 


fn-k+r-1] 

M  =  A  ^ 


k  +  r-l\^{^  +  ^-  (n  +  r- 


(r-1)! 


(r-1)! 


When  n>r  (son  +  r  <2n),  we  have 
N  ^  (m  +  r  -  1)*’'“'>  ^ 

Therefore,  the  likelihood  that  no  two  functions  at  level  k  are  the 
same  is  at  least 


2b~^) 

[r-iy.n’-*^  ' 


which  approaches  1  as  «  approaches  infinity.  Thus,  at  level  k,  the 
expected  number  of  differenf  symmefric  functions  is  at  least 


Ic  +  r  - 

r-1  J 


2('-i)  " 


Over  the  complete  MDD  of  a  random  symmefric  function  on  n 
r-valued  variables,  the  expected  number  of  nodes  is  at  least 


Here,  the  sum  in  the  left  term  can  be  replaced  by 


Lemma  2.  Since 


is  asymptotic  to  ^  for  large  n,  the  average 


number  of  nodes  in  r-valued  symmetric  functions  on  n  variables  is 
asymptotic  to  frr  • 


4  Experimental  Results  for  Binary  Decision 
Diagrams 

Fig.  2  shows  the  distribution  of  BDDs  wifh  respect  to  the  number 
of  nodes,  as  compufed  by  a  program  fhat  counfs  the  exact  num¬ 
ber  of  BDDs  (shown  along  fhe  verfical  axis)  of  n-variable  sym¬ 
mefric  functions  (with  n  plotted  along  one  axis  in  the  horizontal 
plane)  that  have  a  given  number  of  nodes  (shown  along  the 
other  axis  in  the  horizontal  plane).  This  is  an  exact  distribution, 
not  a  sampled  one.  One  horizontal  axis  corresponds  to  n,  the 
number  of  variables,  while  fhe  other  corresponds  to  the  number 
of  nodes  in  a  BDD  with  n  variables.  The  vertical  axis  shows  the 

number  of  n-variable  functions  with  the  number  of  nodes  shown. 

2 

The  worst  case  can  be  clearly  seen  as  a  limit  (which  follows  an  -in¬ 
curve)  beyond  which  fhere  are  no  functions.  Also,  the  proximity  of 
the  average  case  to  the  worst  case  is  seen  as  large,  vertical  exten¬ 
sions  near  the  limit. 


0  0 


Fig.  2.  Distributions  of  n-variabie  symmetric  functions  with  respect  to 
the  number  of  nodes. 


Note  that  the  vertical  axis  plots  the  log  of  fhe  number  of 
cases  to  show  detail  of  the  function  over  a  wide  range  of  n.  This 
also  allows  patferns  to  be  discerned  among  the  data.  Specifi¬ 
cally,  ridges  extending  from  the  origin  have  various  slopes  and 
characterize  certain  classes  of  functions.  These  functions  generate 
BDDs  of  minimal  size,  and  the  exact  number  of  nodes  is  easily 
calculafed.  For  example,  the  ridge  labeled  0  ...  0001/1  ...  1110  in 
Fig.  2  corresponds  to  the  AND  function  and  its  complement.  These 
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TABLE  1 

Number  of  Nodes  in  Sample  BDDs  of  Binary  Functions 


Function 

String  representation 

BDD  node  count 

Constant  0 

00  ...  0 

1 

AND  (n  of  n  voting  function) 

00  ...  01 

n  +  2 

n-  1  of  n  voting  function 

00  ...  011 

2n 

n  -  2  of  n  voting  function 

00  ...  0111 

3n-4 

V  W 

0  1  (wot  n  voting  function) 

00  ...  011  ...1  (vOs,  wts) 

VIV+  2 

OR  (1  of  n  voting  function) 

011  ...  1 

n  +  2 

Constant  1 

11  ...  1 

1 

Parity 

0101  ...  01 

2n+  1 

functions  are  represented  by  BDDs  with  n  +  2  nodes.  Other  ridges 
in  the  figure  correspond  to  symmetric  functions  with  the  string 
representation  (v  Os  followed  by  w  Is,  where  v  +  w  =  n  +  1). 
These  are  the  voting  functions,  which  are  1  if  and  only  if  at  least  w 
of  n  variables  are  1 .  In  the  BDD  of  such  a  function,  a  node  can  be 
labeled  by  i  Os  followed  by  j  Is,  where  1  <  i  <  v  and  1  <  j  <  w,  or 
with  a  0  or  a  1,  for  a  total  of  vw  +  2  nodes.  Table  1  gives  the  size 
of  the  BDDs  of  voting  functions  and  of  the  parity  function  on  n 
variables. 

It  is  interesting  to  compare  our  results  above  with  the  special 

case  of  binary  voting  functions.  It  can  be  shown  that  the  average 

2 

number  of  nodes  in  n-variable  voting  functions  is  (see  Butler 
et  al.  [4]),  for  large  n,  or  one-third  that  of  n-variable  symmetric 
functions.  Further,  the  worst  case  number  of  nodes  in  n-variable 
2 

voting  functions  is  for  large  n,  or  one-half  that  of  n-variable 
symmetric  functions. 
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